Relating possible world based and POMDP based characterization of sensing actions
نویسنده
چکیده
In this paper we discuss two diierent ways to characterize sensing actions: one based on possible worlds and another based on POMDPs. In the past there has been a possible world based characterization when sensors and actuators are perfect, and a POMDP based characterization allowing them to be imperfect. We develop a simpliied POMDP based characterization for the rst case and a possible world based characterization for the second case. We then show the equivalence of the possible world based characterizations and POMDP based characterizations and discuss some of the other characterizations in the literature
منابع مشابه
Two Heuristics for Solving POMDPs Having a Delayed Need to Observe
A common heuristic for solving Partial ly Observable Markov Decision Problems POMDPs is to rst solve the underlying Markov Decision Process MDP and then con struct a POMDP policy by performing a xed depth lookahead search in the POMDP and evaluating the leaf nodes using the MDP value function A problem with this approximation is that it does not account for the need to choose actions in order t...
متن کاملApplication of Partially Observable Markov Decision Processes to Robot Navigation in a Minefield
We consider the problem of a robotic sensing system navigating in a minefield, with the goal of detecting potential mines at low false alarm rates. Two types of sensors are used, namely, electromagnetic induction (EMI) and groundpenetrating radar (GPR). A partially observable Markov decision process (POMDP) is used as the decision framework for the minefield problem. The POMDP model is trained ...
متن کاملReasoning about Sensing Actions and Reactivity
This thesis focuses on the problem of reasoning about sensing actions and the relationship between action theory and reactive control. The first approach to reasoning about sensing actions is due to Moore1. He used the possible world semantics to represent knowledge and treated the accessible relation between world models as a fluent. Later, Hass proposed another way of formulating sensing acti...
متن کاملExploiting Fully Observable and Deterministic Structures in Goal POMDPs
When parts of the states in a goal POMDP are fully observable and some actions are deterministic it is possible to take advantage of these properties to efficiently generate approximate solutions. Actions that deterministically affect the fully observable component of the world state can be abstracted away and combined into macro actions, permitting a planner to converge more quickly. This proc...
متن کاملApproximation Algorithms for Solving Cost Observable Markov Decision Processes
Partial observability is a result of noisy or imperfect sensors that are not able to reveal the real state of the world. Problems that suffer from partial observability have been modelled as Partially Observable Markov Decision Processes (POMDPs). They have been studied by researchers in Operations Research and Artificial Intelligence for the past 30 years. Nevertheless, solving for the optimal...
متن کامل